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Abstract 

For two collections of nonnegative and suitably normalised -weights W — (Wj) and V = (Vn.k), a proba- 
bility distribution on the set of partitions of the set {1, . . . , n} is defined by assigning to a generic partition 
{^ii j < k} the probability Vn^k ^\Ai\ ' ■ ■ ^|Afc|j -where \Aj\ is the number of elements of Aj. We impose 
constraints on the -weights by assuming that the resulting random partitions n„ of [n] are consistent as n 
varies, meaning that they define an exchangeable partition of the set of all natural numbers. This implies 
that the -weights W must be of a very special form depending on a single parameter a e [— oo, 1]. The 
case a = 1 is trivial, and for each value of a ^ 1 the set of possible ^--weights is an infinite-dimensional 
simplex. We identify the extreme points of the simplex by solving the boundary problem for a generalised 
Stirling triangle. In particular, -we sho-w that the boundary is discrete for — oo < a < and continuous 
for < a < 1. For a < the extremes correspond to the members of the E-wens-Pitman family of 
random partitions indexed by {a, 9), -while for < a < 1 the extremes are obtained by conditioning an 
(a, 0)-partition on the asymptotics of the number of blocks of n„ as n tends to infinity. 
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Key-words: exchangeable partitions, E-wens-Pitman t-wo-parameter family, Gibbs distribution, generalised 
Stirling numbers 

1 Introduction 

By a random partition of the set of natural numbers N -we mean a consistent sequence 11 ~ (n„) of 
random partitions of finite sets [n] := {1, . . . , n}. For each n the range of the random variable n„ is the 
set of all partitions of [n] into some number of disjoint nonempty blocks, and the consistency means that 
n„ is obtained from n„+i by discarding the element n -I- 1. A random partition 11 is exchangeable if for 
each n the probability distribution of n„ is invariant under all permutations of [n] . 

Let {Aj^ 1 < i < A;} denote a generic partition of the set [n], and let the Aj be indexed by [k] in order 
of their least elements. Exchangeability of 11 means that 

P(n„ = {Ai,...,Ak})^p{\Ail...,\Ak\) 

for some nonnegative function 

p{\) :=p(Ai,...,Afc) 

of compositions A = (Ai, . . . , Afc) of n, such that p is symmetric in the arguments Ai, . . . , Afc for each fc, p 
is normalised by the condition p{l) = 1, and p satisfies the addition rule 

p(A)^ Mm) (1) 
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where the sum is over compositions fj, derived from A by either increasing a part by one or by appending 1 
at the end of the sequence A. For instance, if A = (3, 2,2), /i assumes the values (4, 2, 2), (3, 3, 2), (3, 2, 3) 
and (3, 2, 2, 1), and ^ speciahses to 

p(3, 2, 2) = p(4, 2, 2) + 2p(3, 3, 2) + p(3, 2, 2, 1). 

A function p with these properties is known as an exchangeable partition probability function (EPPF). Such 
a function uniquely determines the probability law of a corresponding exchangeable random partition 11. 

According to a Kingman's paintbox representation |22[ lUl ITU ] , every such exchangeable partition has 
the same distribution as 11 constructed from some random closed set Z C [0, 1], as follows: let ui, U2, . . . 
be independent uniform [0, 1] variables, independent of Z, and let distinct integers i and j belong to the 
same block of 11 if and only Ui and Uj fall in the same open interval component of [0, l]\Z. 

A distinguished class of exchangeable partitions is the two-parameter family, with EPPF 

p.,.(Ai, . . . , A.) := + - a).,._iT (2) 

where n = Xj and 

m 

are rising factorials, with the convention that {x)oip := 1. Possible values of the parameters {a, 9) are 
either —oo < a < and 9 = m\a\ for some m = 1, 2, . . . , oo ; or < a < 1 and 9 > —a, with a proper 
understanding of |5J in some limiting cases. See |2J for detailed exposition of the general theory of 
exchangeable partitions and features of the (a, 9) family. 

In this paper we are interested in a special class of Gibbs partitions, which generalise Q as follows: 

Definition 1 An exchangeable random partition 11 of the set of natural numbers is said to be of Gibbs 
form if for some nonnegative weights W = {Wj) and V = (V^,fc) the EPPF of 11 satisfies 

k 

p{Xi,...,Xk)= Vn,kY[Wx^ (3) 

for all 1 < fc < n and all compositions (Ai, . . . , Afe) of n. 

For fixed n we can choose arbitrary nonnegative weights Vi, . . . ,Vn and Wi , . . . , Wn which arc not 
identically zero, and use Q to define a random partition of [n] by setting Vn.k = Vk/cn for c„ a suitable 
normalisation constant (see (2^1 for another version of the Gibbs formalism). The block sizes of such a 
Gibbs partition can be realised by Kolchin's model, that is identified with the collection of terms of a 
random sum S = ^1 + . . ■+Xk conditioned on 5* = n, with independent identically distributed Xi, X2, ■ ■ ., 
independent of K. For integer weights this is the distribution on partitions of [n] induced by components 
of a random composite structure built over partitions of [n], when there are Wj possible configurations 
associated with every subset of [n] with j elements, Vk possible configurations associated with every 
collection of k blocks, and a uniform distribution is assigned to all possible composite structures subject 
to these constraints. For instance, if Wj = (j — 1)! and Vk = 9^ (with 6* G N), the product VkWx-^ ■ ■ ■ W\^ 
counts the number of coloured permutations of [n] with cycle sizes (Ai, . . . , Afc) and one of 9 possible 
colours assigned to each of the cycles: then (O reduces to Q with a — 0, 9 > 0. So in this case, there 
is an infinite exchangeble partition 11 whose restrictions n„ are all of the Gibbs form ||3Jl. Many other 
combinatorially interesting examples of Gibbs partitions n„ can be given, using the prescription (|2Jl for 
each fixed n: see for instance But typically the distributions of these combinatorially defined n„ 

are not consistent as n varies, so they are not realisable as the sequence of restrictions to [n] of an infinite 
Gibbs partition. 

The special case of l/-weights representable as ratios Vk,n = Vfc/c„ was studied by Kerov 13 in the 
framework of Kolchin's model. In this case one assumes a single infinite sequence of weights (14) and the 
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Cn's appear as normalisation constants. Kerov ^2] established that the Gibbs partitions of this type are 
precisely the members of the two-parameter family 

We will show that in the more general setting ||3J), allowing an arbitrary triangular array Vk.n, the W- 
weights must be still as in jSJ, with a single parameter a G [—00, 1] defining the type of Gibbs partition. 
The case a = 1 is trivial. For each nontrivial type a < 1 the set of all possible ^-weights is an infinite 
simplex Vq. We identify the extreme elements of Vq by solving a boundary problem for an instance 
of the generalised Stirling triangle, as introduced in another paper by Kerov ^21 (also see |14[ Chapter 
I]). It turns that the nature of the extremal set depends substantially on the type. According to our 
main result, stated more formally in Theorem II 21 there are three qualitatively different ranges of a. For 
a G [—00, 0[ the extremal set is discrete and corresponds to the members of the (a, 6')-family. For a — 
this set is continuous and still corresponds to the members of the (a, 6')-family (Ewens' partitions). For 
a g]0, 1[ the (a, 0)-partitions are not extreme, rather the extremes of Va comprise a continuous (ajs)- 
family (with parameter s G [0, 00]) which appears by conditioning the (a, 0)-partitions on the asymptotics 
of the number of blocks. In |21| the (a|s)-partitions were derived from their Kingman's representation, 
with the random closed set Z being the scaled range of an a-stable subordinator conditioned on its value 
at a fixed time. This identification of extreme elements of Vq for < a < 1 was indicated without proof 
in ini Theorem 8]. 

2 Some basic results 

To define an exchangeable partition the weights in (O are normalised by the condition 

n 

J2 Vn,kBn,kiW) = 1 for n - 1, 2, . . . 

fc=i 

where Bn,k is a partial Bell polynomial in the variables W — {Wi, W2, ■ ■ .), 

where the first sum expands over all partitions of [n] into k blocks, and the second over all compositions of 
n with k parts. Observe that there is a redundancy in the possible values of parameters: the distribution 
of the Gibbs partition is unaffected by simultaneous substitutions when either Wj — > ^^Wj and Vn^k — > 
7~"F„.fc, or Wj jWj and Vn^k l~^Vn,k for 7 > 0. Throughout we assume the normalisation 
Vi^i — Wi — 1. Granted the normalisation and excluding the trivial case of a partition 11 with only 
singleton blocks the ambiguity amounts to the geometric tilting Wj ^^^^Wj and Vn^k ^ l^^"'yn,k- 
Our starting point is the following elementary lemma. 

Lemina 2 The weights {Wj) and (Vn^k) with Wi — Vi.i define a partition of Gibbs form if and only if 
for some b > and a <b the following two conditions are satisfied: 

(i) 

W, = (6-a),_itb, J = 1,2,... (5) 

(ii) the Vn.k satisfy the recursion 

Vn,k = {bn - ak)Vn+i^k + K+i.fe+i , l<k<n. (6) 

Proof. The trivial singleton partition of N is of the Gibbs form with either Wj — l(j = 1) or Vn,k — 
l{k = n). Excluding the singleton partition, the Gibbs prescription forces Wj > for all j = 1,2, . . ., 
because p{n) = Vn,iWn and p{n) > for all n (as follows from Kingman's representation). 

The only Gibbs partition with V„^2 = for some n > 2 is the trivial one-block partition, in which 
case the conclusion is obvious. Excluding also this trivial case we may assume that Vn.2 > for all n and 
Wj > for all j. Introducing rj — Wj/Wj^i we find then that ^ amounts to 

fe 
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for all compositions of n with k parts. Applying this for k = 2 and using Vn,2 > we see that + rj 
depends only on i + j, hence r^+i — rj is constant and therefore {rj) is an arithmetic sequence 

rj=bj-a, j = l,2,... 

where necessarily 6 > and a < 6 to ensure Vj > (as entailed by Wj > 0). Now, from Wj = ri ■ ■ ■ Vj-i 
we obtain and © follows because 

fc k 
^rx, = h^\j-ak. 

Inverting the argument we see that p defined by |SJ, ® and lO satisfies the addition rule 1^. □ 



Two parameters a and h may be reduced by the geometric tilting to a single parameter a g [— c», 1], 
corresponding to the sequence of W^-weights 

Y/. = [ (1 " ")j-iT for - cx)< a < 1 , 
1^ 1 for a = — cx) . 

The case a = 1 corresponds to the trivial singleton partition and will be excluded from further consider- 
ation. 

Definition 3 For a < 1 let Va be the set of all distributions of infinite partitions 11 of type a, whose 
EPPF p is of the Gibbs form ^ with these W^- weights Q , and let Va be the set of nonnegative solutions 
V — (Vn^k) to the backward recursion 

Vn,k = ln,kVn+l,k + Vn+l,k+l , (8) 

with Vi^i = 1, where the coefficient for I < A; < n is given by 

( n — ak for — cx) < a < 1 , 

7n,fc S I, r (9) 
I k tor a — — oo . 

Lemma 12 establishes an afhne bijection between these two convex sets Va and Vq , hence also a 
bijection between the sets of their extreme points. To spell this out, each probability distribution P of 11 
with F ^ Va induces a distribution of Kn, the number of blocks of n„, according to the formula 

PiKn ^ k) ^ Vn,kBnAW) , 

obtained by summation of © over all partitions of [n] with k blocks. On the other hand, the conditional 
distribution of n„ given the number of blocks is 

p(n„ = {A,, A,}\K,, ^k) = ■ (10) 

Thus the distribution P of 11 determines the weights T4,,fc, and vice versa. Moreover, the weak topology 
on Va^ defined by pointwise convergence of EPPF's, corresponds in Vq to convergence of the Vn^k for all 
1 < fc < n. 

Observe that H10|l does not involve the F-weights. Thus for Gibbs partitions of a given type a, the 
sequence of block counts {Kn) is a sequence of sufficient statistics for (n„). In particular, for (a, 6')- 
partitions the sequence {Kn) is a Markov chain whose time-reversed transition probabilities are the same 
for all e. 

By general theory of sufficient statistics and extreme points [HI Eli each p £ Va can be uniquely 
represented as a convex mixture of the extreme elements of Vai and the same can be said of Va- 

Let 4'{a,9) := {(j)n,k{oi,0),l < k < n) denote the particular sequence of T^-weights appearing in the 
two-parameter formula 10), that is 

(l)n,k{a,e) ^ ——— iova^-oo (11) 

{U + ijn-lT 
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where 9 > a for < a < 1, and 9 = —\a\m, with m = 1, 2, . . . , oo for a < 0. In the case a — —oo the 
formula 

'/'n,fc(-oo,moo) ^ ("^)fci (^^2) 
m" 

is the a ~f —oo limit for m = 1, 2, . . . , oo, with m — oo corresponding to the trivial singleton partition. 
We have <j>{9,a) £ Vq, as can be readily checked by algebra. A characteristic property of this class of 
solutions is summarised in the following corollary: 

Corollary 4 |13l Theorem 7.1] Members of the two-parameter family, with EPPF given by are 
the only partitions of the Gibbs form with V -weights representable as ratios Vn.k — Vk/cn- 

Proof. Assuming > for some >c > 2 exchangeability implies Vk > for k < x and the recursion jnj 
becomes 

— -ti — ak — ^iii — hn , 1 < k < k . 
Vk c„ 

Since the left side does not involve n and the right side does not involve k, their common value is a 
constant, say t. Then Vk = (t -\- a)k^i^a, and because V2 > we have t > —a. In the case a < the 
requirement Vk > forces t — — ma for some integer m ^ \. By a similar argument Cn — ^" b)n—i^b- 
The form ^ follows by redundancy. □ 



3 The boundary problem 

To embed our discussion of extremes in a wider context let V be the convex set of nonnegative solutions 
V = {Vn,k) to the recursion 

Vi,i = 1; Vn,k = ln.kVn+l,k + Sn,kVn+lM+l , (1 < fc < (13) 

where the coefficients j„^k and Sn^k form two arbitrary triangular arrays of non-negative numbers. This 
recursion is associated with a generalised Pascal triangle, that is an infinite directed graph G with vertex 
set {{n,k) 1 < k < n}, such that each vertex (n, fc) has two immediate successors (n + l,k) and 
(n + + 1), the multiplicities of the outgoing edges being 7„^fc and 5n.k, respectively. For a directed 
path connecting the root (1, 1) and a node {n,k) we define the weight of the path to be the product of 
the multiplicities along the path, and let the dimension d"'^ be the sum of weights of all such paths, with 
the convention d^'^ = 1. The dimension is a unique solution to the forward recursion 

= <5„,fc-id"'^-i + 7«,fcd"''' (14) 

where 6n.o = 0. In the case of the standard Pascal triangle with "fn^k = 1, Sn,k = 1 this reduces by an 
obvious shift of indices to the familiar recursion for the binomial coefficients. 

Note that each path in G from (1, 1) to (n, k) may be written as a sequence ki, k2, . . . ,kn with ki = 1, 
kn ~ k and fcj+i — kj G {0, 1}. Consider now a random process (iiTn) with 1 < Kn < ri such that for 
every path from (1, 1) to (n, fc) the conditional probability 

P(Xi = 1, if2 = A:2, . . . , = fc„-i |i^„ = fc) 

equals the weight of this path divided by d"'*'. That is to say, the process (Kn) is Markovian with 
co-transition probabilities 



P(i^„_i=j|X„ = fc) 

where 



dn-\.,j qn-i{j,k) 



7„_ij if j = k 

q„_l(j, fc) = <( ^ri-lj if j = fc - 1 (15) 

else . 



The formula 

FviKn = fc) = Vn,kdn,k 
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establishes a bijection between V and the set of laws of such Markov chains. So we identify V with this 
set of Markovian laws. 

By some well known general theory j^EHSl^DEDi each extreme law in V can be represented as a weak 
limit of the conditional laws for {K„) given K^, = k^, for v ^ (X) and some sequence (>fy, v = 1,2,.. .). 
Following ^6|| we will call the set of these limit laws the boundary of G (sometimes also called 'Martin 
boundary' or 'maximal boundary' or 'the set of Boltzmann laws'). 

More explicitly, extending the above definitions, let the weight of a path in G connecting (n, k) and 
{ly, x) be the product of multiplicities along this path, and define the extended dimension dj'j^ to be the 
total weight of paths in G connecting (n, k) and {v, k) (which is zero unless k < x <v and v > n). Thus 
d^'^ = d"'*' and — 1. For each chain directed by some V the conditional law of {Ki, . . . , Kn~i) 
given — x (for > n) is the same, and is determined by 

^k\K, = x) = IC;- rf"'' > where V^^ := ^ • (16) 

Clearly, for {y^ x) fixed, V^^'^ satisfies (|13f) for n < v, thus if V^'^" converge as, v ^ oo along some 
infinite path {x^) then the limit is certainly in V. The infinite paths which induce the limits are called 
regular, and the set of such limit elements of V is the boundary of G. For regular path {x^) we say that 
the boundary element V = lim^_>oo 1/"'^" and the corresponding law Py are induced by the path. 
The next lemma is adapted from El • 

Lemma 5 Identifying the elements ofV with the laws Py for Markov chain (Kn) we have: 

(i) each extreme element V £ V belongs to the boundary of G, that is may be represented as a limit of 
the functions V^'," along some regular path {x^), 

(ii) for every G V, under Py almost all paths of (Kn) are regular, 

(iii) a solution V £ V is extreme iff the set of regular paths which induce V has Py -probability one. 

Example. The instance of the boundary problem for the standard Pascal triangle has been treated by 
many authors. In this case it is more convenient to label the nodes by nonnegative integers {(n, fc), < 
k < n}. The corresponding chains < Kn < n are those whose increments Kn+i — Kn are exchangeable 
random variables with values in {0, 1}. The dimension function is given by the binomial coefficients 
'^" '^ ~ (fc) ' fc ^ (^-fe)- ^ path (xi,) is regular if and only if there is a limit x^/i/ s for some 
s G [0, 1], which corresponds to a boundary element V{s) with Vn,k{s) = s'^(l — s)"~'"', hence the boundary 
is homeomorphic to [0, 1]. This is dc Finetti's representation of infinite exchangeable sequences of zeros 
and ones. Since Kn under Py(s) is the number of successes in a series of n Bernoulli trials with success 
probability s, the law of large numbers ensures Py(^s){Kn/n — > s) = 1. Hence each V{s) is extreme by 
Lemma El (iii). In fact, to ensure regularity of a path we only need to check the convergence of V^'^'q" 
for each n, because the bivariate array {Vn^k) satisfies the backward Pascal recursion if each Vn^k is a 
finite difference of the sequence (Kt,o)- It follows that a sequence (Ki,o) with Vnft = 1 is representable 
as a convex mixture of functions Ki,o(s) = (1 — s)", s G [0, 1] if and only if the associated array (Ki,fc) 
is nonnegative, in which case such representation is unique. The last assertion is widely known as the 
resolution of the Hausdorff problem of moments. 

In general, however, the set of extremes (sometimes called the 'minimal' boundary) may be smaller than 
the boundary. This kind of pathology is illustrated by the following example. 

Example Consider a graph G with the following sets of nodes and edges. Level has a single node 0, 
which is the root of G. Level 1 has two nodes ai and ci, and the root is connected to the nodes ai and 
ci. Level 2 has three nodes a2,&2,C2, so that ai is connected to 02 and &2, while ci is connected to 62 
and C2- On each further level n > 2 there are exactly 3 nodes a„, 6„, c„. Node a„-_i is connected to a„ 
and bn, node &„_i is connected only to 6„, and c„_i is connected to 6„ and c„. There are no other edges 
and the edges just described all have multiplicity 1. 
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Every infinite path in G starting at is regular. The boundary of G consists of three elements 14, Vb 
and Vc, induced by the paths a = (0, ai, a2, ■ ■ ■), b = (0, 6i, 62, • ■ ■), and c = (0, ci, C2, . . .), respectively. 
Clearly, Va is a unit mass at a and Vc is a unit mass at c. Observe that for > n > 2 there are v — n 
paths from a„ to , the same number v — n oi paths from c„ to b^ , and there is only one path connecting 
bn and b^- Sending — > 00 we see that Vb is the mixture Vb = 14/2 + K/2. 

It follows that the boundary {Vq, Vh, Vc} is larger than the set of extremes {Va, Vc}. The distribution 
Vb violates the condition in Lemma |S1 (iii): though Vb can be induced by many paths (unlike Vq and Vc), 
the set of these paths has Vf,-probability zero. 

Further examples may be related to other classical number triangles and their generalisations (as in 
|18[ I26p although explicit results on the boundary problem are scarce. For later application we record 
some useful general tools. 

Obviously, Vn,k ~ implies V^.,t = for ly > ji, >c > k. In particular, the trivial solution with Py (i4r„ = 
n) = 1 is characterised by V2.1 = 0, and another trivial solution with ¥v{Kn = 1) is characterised by 
V2.2 = 0. Both trivial solutions are extreme. 

Lemma 6 A path (x^) is regular if and only ifV^^'i" converge as v 00 for each n. Every path with 
V^'x" is regular and induces the trivial solution with V2.1 = 0. 

Proof. Because Vn+i^k+i = iVn,k — Jn,kVn+i,k)/Sn.k , a double induction, first in k and then in n, shows 
that V € V is uniquely determined by the entries (V„^i). □ 

The next lemma expresses a well known stochastic monotonicity property of the kind of inhomogcnous 
positive integer-valued Markov chains involved here. We indicate an algebraic proof, but it can also be 
derived probabilistically by a coupling argument. See and papers cited there. 

Lemma 7 For v > n fixed, V^'i is nonincreasing in k. 

Proof. The proof is by induction in v. Suppose the claim is true for some v, then for fixed 1 < >c < n 
and nonnegative a, (3, 7, 5 with q; + /3 = 1, 7 + 5=1 we have 

(where V^'^ ~ Vn'i'^^ = 0)- For a suitable choice of a,/?, 7, (5 the left side of the inequality equals 
d^'^V^\^''^ while the right side equals o?"'^l^^^'^^^, as follows readily from H16|l . The induction step 
follows. □ 

Lemma 8 Suppose for m — 1,2,... there are solutions V(m) G V such that Vn,m{^)d"""^ 1 as 
n — > 00, then each V(m) is extreme and satisfies 



V(m) 



f lim i4:„ = = 1 . (17) 



If above that V2.i(m) as m 00 then V(m) converges to the trivial law V(oo) with Py(oo)(i4'„ = 
n) = 1, and in this case the set of extreme elements of V is {V(l), V(2), . . . , V(oo)}. 

Proof. All paths (x^) are nondecreasing, thus P^/(,„)(_?^„ > m) = 0, and because 

Pv(„) ( lim Kn^m] > Vv(m){Kn = rn) = Vn,m{m) d"'™ ~> 1 , as n 00 

we have H17|l . Easily from LemmaEl V(m) is extreme and can be induced by arbitrary path with ^ m 
for large enough ly. 

Now let x^ 1 00. By Lemma Eland the above argument we have for v 00 

^2,1 < ^2,1 V2.i{m) , 
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hence letting m oo and invoking Lemma El shows that [xu) induces y(oo). Since every path has either 
finite or infinite limit, every path is regular and the list of extremes is complete. □ 



Example A discrete family of solutions with the properties as in Lemma |S1 exists for a graph called 
'the q-Pascal triangle'. The set of nodes of the graph is {(n,j),0 < j < n}, the multiplicities are 
1n,k — 'z'^"^, Sn,k = 1, and the dimension function is given by the g-binomial coefhcients. The boundary 
has been determined in |12l I19| . 



Our main tool for identifying the boundary in the continuous case is the following lemma. Compare 
with where the same method is applied to obtain a different generalisation of de Finetti's theorem 
for sequences of zeros and ones, and see |2] for another closely related setting. 

Lemma 9 Suppose there is a sequence of positive constants (c„) with c„ oo, and for each s G [0, oo] 
there is a solution V{s) G V which satisfies 

Pv{s) ( lim K,/c, = s)=l. (18) 

Suppose the mapping s i— > V{s) is a continuous injection from [0, oo] to V with and oo corresponding 
to the trivial solutions 

Pv(0) (i^„ = 1) = 1 , Py(oo) {Kn = n) = 1 . 

Then 

(i) a path {xu) is regular if and only if lim,y^oo Xvjcy = s for some s G [0,oo], in which case {x^) 
induces V{s), 

(ii) {V{s),s e [0, oo]} is the set of extreme elements ofV. 

Proof, (i) Let {x,^) be a path with >c^/c„ s for some < s < oo. Using the fact that for all y G V the 
co-transition probabilities are the same, and exploiting the monotonicity, as in LemmaUJ we can squeeze 

Pv{s~2e)iKn = l\K,/Cu <S-€)> d"' V,r;r > ^V{s+2,){Kn = l\K,/c^ > s + e) 

for e < s/2 and v sufficiently large. From this and the assumption 1)18(1 we derive 

Vn,i{s-2t) > V^^:^" > V{s + 2e) 

for large v. Letting e — > and using the assumed continuity we conclude that (x^) is regular and induces 
V{s). The cases s = and s = oo are treated similarly. By the same argument, a path (x^) cannot be 
regular if Knj Cn has distinct subsequence limits. 

(ii) Follows from (i), (|18|l and Lemma El (iii). □ 



The lemma is designed to cover normalisations c„ = o(n). Compare this with the standard Pascal 
triangle, where we assume the scaling by c„ = n that leads to parameterisation of the boundary by [0, 1]. 



4 Stirling triangles 



A generalised Stirling triangle, as introduced by Kerov [12) . is a generalised Pascal graph G with multi- 
plicities of the form 'yn,k = + ifc , and Sn^k = 1- We will consider the boundary problem in the special 
case with coefficients ^ where a € [— oo,l[. In this case the dimension d"''^ is a generalised Stirling 

which may be defined in many ways. For instance, it is determined by the recursion H14|) . 



number 



or by specialising the Bell polynomial Bn.k for weights Q), or as the connection coefficient in 



(^)t 



E 

k=l 



{x)kTa (for a ^ -oo) , 
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or as coefficient at a;" in the series expansion of 



(i-(i-^r)* 



(a ^ 0, — oo) 



(19) 



For a = —oo these are the Stirhng numbers of the second kind, for a = the signless Stirling numbers 
of the first kind and for a = —1 the Lah numbers. Alternatively, by the definition of dimension d" 
the sum of weights we obtain 



5Z n n ('^ ^ ' for a > -oo 

l=no<"l<---<"fc-l<n&=n j = l nj-i<i^<nj 

k 



oo l=no<rii<...<r!fc_i<nfc=n j = l 

The extended dimension satisfies a recursion similar to 114|l . from which we find 



V — n + 1 



for a > — oo . 



as 

(20) 
(21) 

(22) 



ka—n+l 



A similar formula for a = — oo requires a further generalisation of Stirling numbers as in [77l| . To stress 
dependence on the parameter a, we shall denote the generalised Stirling triangle by Ga and denote by 
Va the set of nonnegative solutions to 

By H22|) . identifying the boundary of Ga is equivalent to finding the limiting regimes for which 
entail convergence of certain ratios of the generalised Stirling numbers. By Lemma El this is reduced to 
the analysis of possible limits for V^'^ = d'^l/d"''^ . This line seems difficult to pursue, because it requires 
asymptotics of Stirling numbers of distinct types. Still, there is a much better formula which involves 
Stirling numbers of a single type: 



Lemma 10 For a ^ -oo and >n>l, ly > >c>l 



where d"'^ 



(1 - a)n-i 



1 



E 



V — n 
j - n 



(1 - a)j-iT 



(23) 



Proof. The left side is the conditional probability of Kn = 1 given Ki, = which is common for all 
V ^ Va- On the other hand, for partition of the Gibbs form the probability that 11^ has k blocks and 
the set [n] falls completely in one of the blocks is 



v — n 
j - n 



and to obtain the conditional probability we should divide this by the probability V^,~.cB^,^{W) for m: 
blocks. Specialising the weights we arrive at (|23|l . □ 

The extension of (|23|l to the case a = — oo is obvious. The identification of the extremes of Va breaks 
naturally into cases. 



4.1 Case — oo < a < 

We claim that the conditions of Lemma|Slhold with V{m) = (p{a,m\a\). The lemma requires that 



(m|a|) 



(24) 
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goes to 1 as n ^ cxD which seems difficult to check directly. Only in the case a = — 1 this is straightforward 
due to the handy formula for Lah numbers 



n - 
m 



1 



7,1 



1 / to! 



Still, (|17|l follows trivially from Kingman's representation. In this case the set Z divides [0, 1] into 
TO intervals of random sizes distributed according to the symmetric Dirichlet density proportional to 
(Ci ■ ■ ■ fm)'"'""'^ on the simplex Sj^j = 1. We have also V{m) — * V{oo) = (/)(q!, ooa) with Vn,k{oo) = 
l(n = k). Thus by Lemma|Hl the set of extremes is {0(a, TO|a|), m = 1,2,..., oo}. As a by-product we 
have shown that the right side of (|24|l approaches 1 as n ^ oo. 

Another consequence is the following analogue of the HausdorfF moments problem. To interpret Vn,k 
as a generalised {k — l)th-order difference of the sequence consider the difference operator 

which tranforms a sequence (u„) into another sequence 

(Aqu)„ = u„ - (n + 1 - a)un+i ■ 

For V = {Vn^k) solving ©, setting (u„) = {Vn,i) we have 

Vn,k = (A(fc_i)c(. . . (A2a(AQu) . . .)„_fc+l . (25) 

Note that except for a = the operators Ajq for different j do not commute. 
Corollary 11 Choose a <Q. A sequence {un) with iti = 1 can he represented as 

oo 

EQm 
, {m\a\ + l)„_iT 

for some probability distribution q on {1, 2, . . . , oo} if and only if the array (Ki,fc) computed by (|25|l with 
(un) — (Vn^i) is nonnegative. 

Another consequence is the asymptotics which does not seem obvious analytically: 



lim 

y — »oo 



V — 


n H 


- 1 ■ 




TO - 


-k- 


hi 


ka-n+\ |a|''(TO)fcJ, 




V 




(|a|TO)„-r 




TO 







for fc = 1, 



4.2 Case a = —oo 

This is the limiting case for a 



oo. Lemma IHl holds with solutions 

im)ki 



Vn^kim) := 0„,fc(-oo, ooto) 



for fc = 1, . 



The conditions are checked as in the previous case, using the fact that (/>(— oo, ooto) corresponds to the 
elementary coupon-collecting partition derived from a sequence of independent random variables with 
uniform distribution on to possible values. 



4.3 Case a = 

That the extremes correspond to the Ewens family can be seen from Lemma |31 Take c„ = log n and 
recall the well known law of large numbers: that for (0, 9) partition K„ ^ dlogn a.s. [3]. The fact follows 
from the representation of Kn as a sum -I- . . . + ^„ of independent Bernoulli random variables with 
success probability {0 + j — 1)""^. 
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We want to see how the Ewens family emerges from the asymptotics of Stirhng numbers, making 
sense of the statement 'ESF(6') conditioned on Kn ~ slogn is ESF(s)'. For the Stirhng numbers of the 
first kind there is an asymptotic formula 



^^""^ (log.)- 



r(x)r(i + s) 



valid for >c ^ slogi^, as — > oo uniformly in s bounded away from and oo. Assuming this regime for 
>f some calculus shows that (|23|) is asymptotic to a Riemann sum for the integral 



(s + l)„-iT ' 

It is seen that a path is regular if and only if Xu/logv s for some s G [0, oo] and that the solution 
induced by such a path has Vn,i{s) = (j>n,i{0,s), hence V{s) = (/)(0,s). This identifies the boundary of 
Go, but it is not clear, by this approach, how to show that all solutions 0(0, 9) are extreme (which follows 
from the above law of large numbers). 

Thus every Gibbs partition with Wj = {j — 1)1 is a unique mixture of Ewens' partitions: 



/>oo 

/ PQ.eQidO) , peVo 
Jo 



for some probability distribution Q on [0,cxd]. This result was conjectured in |12l I14| . and also stated 
without proof in attributed to Frank Kelly. The analogue of CoroUarvIl 1 1 holds with kernel 9^ /{9)n^. 

4A Case < a < 1 

This is the most interesting case. LemmaElis applied in this case with V{s) — ^p{a\s), s £ [0, oo], the 
law for (Kn) derived from an {a, 9) partition conditioned on Kn ~ sn". This is the partition derived 
by sampling from a Poisson-Kingman random discrete distribution denoted in [211 §5.3] by PK(/9Q.|t) for 
t = s~". Here we pursue the connection with Stirling asymptotics. 

Let ga be the density of the Mittag-Leffler distribution, which is determined by the moments 

T(Pa + 1) 



For k ^ sn", n — > oo there is the asymptotic formula 

■n'" a'^''' ga{s) 



r(") „l-fe 



m 



which holds uniformly is s bounded away from and oo. The formula was derived in |25l Section 5] from 
a local limit theorem for the stable density (note that these Stirling numbers are those of multiplied 
by a^'^). In .^1 Theorem 11] the formula was concluded by the singularity analysis of the generating 
function (jl9|l . and in "H", Corollary 12] the formula appeared in connection with an urn model similar to 
that in |?7] . 

Substituting this approximation into H23|l . using r(i/ + j3)/T{i') ^ and introducing the variable 
y = j /ly we arrive at a Riemann sum for 



K,i(s) (1 - a)n-n = , , I ' y"-i-"(l - y)-^-"5a(s(l ~ y)-") dy. (26) 

r(l - a)ga{s) 7o 

Using the change of variable t — s~^/" and the formula 

/a(2/) = ay~^~"<?a(y~") 

connecting g^ to the stable density fa, whose Laplace transform at A is exp(— A"), we get 

VnAs) (1 - «).-rr = f y-^-V.m - y))dy , 
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which is an instance of |21l Equation (66)] . It follows that a path is regular if and only if Xv/v" converges, 
in which case the induced solution is ^{a\s). This identifies the corresponding partition as a Poisson- 
Kingman partition, as indicated above. 

Thus the Stirling asymptotics enable to determine the boundary of Ga , but we do not see how they 
imply that all boundary solutions are extreme. But this is obvious from the law of large numbers: 
because for (a, 0)-partition Kn/rf^ has a strong limit, the same is true for the {a\s) partition, obtained 
by conditioning. The paintbox Z for the (a|s) partition can be represented by the division of [0, 1] into 
excursions of a Bessel bridge (or Bessel process) conditioned on its local time at zero being equal to s. 

We summarise conclusions of this section in the following theorem, which is our main result. 

Theorem 12 Each Gibbs partition of fixed type a G [— oo,l[ is a unique probability mixture of the 
extreme partitions of this type, which are 

(i) (a, \a\m) -partitions with ni — 0,1, ... , oo, for a £ [— oo, 0[ ; 

(ii) the Ewens (0, 6) -partitions with 6 £ [0, oo], for a = ; 

(iii) the Pois son- Kingman (a\s)- partitions with s G [0, oo], for a G ]0, 1[ . 

Acknowledgement We are indebted for Philippe Flajolet and Grigori Olshanski for alerting us of the 
connections with their work. 

References 

[1] D.J. Aldous. Exchangeability and related topics. In Ecole d'ete de probabilites de Saint-Flour, 
XIII— 1983, volume 1117 of Lecture Notes in Math., pages 1-198. Springer, Berlin, 1985. MR 883646 

[2] D.J. Aldous. Tail behavior of birth-and-death and stochastically monotone processes, Probab. Th. 
Related Fields 62(3): 375-394, 1983. MR 0688645 

[3] R. Arratia, A. D. Barbour, and S. Tavare. Logarithmic combinatorial structures: a probabilistic 
approach. EMS Monographs in Mathematics. European Mathematical Society (EMS), Ziirich, 2003. 
|MR 2032426 

[4] A. Borodin and G. Olshanski. Harmonic functions of multiplicative graphs and interpolation poly- 
nomials. Electronic J. Combinatorics, 7: paper #R28, 2000. MR 1758654 

[5] P. Diaconis and D. Freednian. Partial exchangeability and sufhciency. In J. K. Ghosh and J. Roy, 
editors. Statistics Applications and New Directions; Proceedings of the Indian Statistical Institute 
Golden Jubilee International Conference; Sankhya A. Indian Statistical Institute, 205-236, 1984. 
MR 786142 

[6] E.B. Dynkin. Sufficient statistics and extreme points. Ann. Probability, 6:705-730, 1978. IMR 5183211 

[7] C. Banderier, P. Flajolet, G. Schaeffer and M. Soria Random maps, coalescing saddles, singularity 
analysis and Airy phenomena. Random Str. Alg., 19: 194-246, 2001. MR 1871555 

[8] P. Flajolet, J. Gabarro and H. Pekari, Analytic urns. To appear in Ann. Probab. arXiv:PR/0407098 

[9] A. V. Gnedin. The representation of composition structures. Ann. Probab., 25(3):1437-1450, 1997. 
IMU. 14576251 

[10] A. Gnedin and J. Pitman. Regenerative composition structures. To appear in Ann. Probab. 
|arXiv:PR/0407098| 

[11] J.G. Kemeny, J.L. Snell and A.W. Knopp Denumerable Markov chains Springer, NY, 1976. 
IMR 407981 



12 



[12] S. V. Kerov. Combinatorial examples in the theory of AF-algebras. Zap. Nauchn. Sem. Leningrad. 
Otdel. Mat. Inst. Steklov. (LOMI), 172 (Differentsialnaya Geom. Gruppy Li i Mekh. Vol. 10):55-67, 
1989. Translated in J. Soviet Math. 59(5): 1063-1071, 1992. 169-170, 1989. MR 1015698 

[13] S. Kerov. Coherent random allocations and the Ewens-Pitman sampling formula. PDMI Preprint, 
Steklov Math. Institute, St. Petersburg (1995) 

[14] S. Kerov. Asymptotic representation theory of the symmetric group and its applications in analysis, 
Amer. Math. Soc., Providence, RI, 2003. MR 19848681 

[15] S. Kerov. The boundary of Young lattice and random Young tableaux. DIM ACS Ser. Discr. Math. 
Theor. Comp. Sci., 24:133-158, Amer. Math. Soc. Providence, RI, 1996. MR 13 63510 

[16] S. Kerov, A. Okounkov and G. Olshanski. The boundary of the Young graph with Jack edge 
multiplicities. Internat. Math. Res. Notices, (4):173-199, 1998. MR 1609628 

[17] J.F.C. Kingman. The Mathematics of Genetic Diversity, SIAM, 1980. MR 591166 

[18] G. Labelle, P. Leroux, E. Pergola and R. Pinzani. Stirling numbers interpolation using permutations 
with forbidden sequences. Discrete Math. 246 (2002), no. 1-3, 177-195. MR 1887485 

[19] G. Olshanski, g-Pascal triangle and g- Young, November 2001, unpublished notes. 

[20] J. Pitman. Exchangeable and partially exchangeable random partitions. Probab. Th. Rel. Fields, 
102:145-158, 1995. IMR 13372491 

[21] J. Pitman. Poisson-Kingman partitions. In D.R. Goldstein, editor. Science and Statistics: A 
Festschrift for Terry Speed, volume 30 of Lecture Notes - Monograph Series, pages 1-34. Institute 
of Mathematical Statistics, Hayward, Cahfornia, 2003. MR__2004330 

[22] J. Pitman. Partition structures derived from Brownian motion and stable subordinators. Bernoulli, 
3:79-96, 1997. MR 1466546 

[23] J. Pitman. An extension of de Finetti's theorem. Advances in Applied Probability, 10: 268-270, 1978. 

[24] J. Pitman. Combinatorial stochastic processes. Lecture notes for St. Flour 

course, July 2002. Technical Report no. 621 Dept. Statistics, U.C. Berkeley 
|http : / / stat-www . be rkeley . edu/tech- report s/| To appear in Springer Lecture Notes in 
Mathematics. 

[25] J. Pitman. Brownian motion, bridge, excursion and meander characterized by sampling at indepen- 
dent uniform times. Electronic Journal of Probability, 4:1-33 (Paper no. 11), 1999. MR 1690315 

[26] A. Regev and Y. Roichman Statistics of wreath products and generalized Bernoulli-Stirling numbers. 
Preprint 2004, |arXiv:math.CO /0404354| 

[27] E.G. Tsylova, Probabilistic methods for obtaining asymptotic formulas for generalized Stirling 
numbers, in Statistical estimation and hypothesis testing methods (Russian) (Perm! , 1990), 165-178, 
Perm. Cos. Univ., Perm'. Translated in J. Math. Sci., 75: 1995, 1607-1614. MR 1253585 

[28] E. G. Tsylova, The asymptotic behavior of generalized Stirling numbers, in Combinatorial- algebraic 
methods in applied mathematics (Russian), 143-154, 158, Gor'kov. Cos. Univ., Gorki. MR 090428^ 

[29] A. M. Vershik, Statistical mechanics of combinatorial partitions, and their limit shapes Funct. Anal. 
Appl. 30: 90-105, 1996. IMR, 1402(1791 



13 



